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Abstract 

Template-based  automatic  target  recognition 
(ATR)  algorithms  such  as  the  Synthetic  Aperture 
Radar  Target  Location  and  Recognition  System 
(STARLOS)  algorithm  typically  use  separate  tem¬ 
plates  to  represent  target  signatures  for  ranges  of 
articulations,  aspect,  depression,  and  squint  angles. 
There  is  a  performance  tradeoff  between  ATR 
accuracy  and  the  number  of  templates  used.  We 
use  a  hybrid  model/template  with  target  models  to 
augment  a  small  set  of  target  templates.  The  basic 
idea  will  be  to  determine  the  transformation  or 
perturbations  required  to  modify  a  given  template 
so  that  it  accurately  represents  the  signature  of  a 
neighboring  sensor  geometry  or  target  articulation. 
By  incorporating  a  model  for  these  perturbations 
into  the  ATR  algorithm,  we  can  reduce  the  total 
number  of  templates  required  and  provide  robust¬ 
ness  to  new  collection  geometries,  obscuration,  and 
articulation. 

1.  Introduction 

Two  of  the  major  thrusts  in  the  ATR  area  are  in 
model  based  vision  (MBV)  and  template  match¬ 
ing  (TM).  Numerous  algorithms  have  been  devel¬ 
oped  following  these  two  paradigms.  Applications  of 
these  ATR  approaches  include  the  DARPA  Moving 
and  Stationary  Target  Acquisition  and  Recognition 
(MSTAR)  program  which  uses  the  MBV  approach, 
and  the  DARPA  Semi  Automated  Image  Process¬ 
ing  (SAIP)  and  Army  STARLOS  programs  which 
employ  a  TM  approach. 

Both  algorithm  paradigms  address  some  serious 
difficulties  in  recognition  of  targets  in  high  clutter 
from  synthetic  aperture  radar  (SAR)  data.  SAR 
signatures  are  nonliteral  and  change  very  rapidly 
as  a  function  of  sensor /target  geometry.  Other  ef¬ 


fects,  such  as  layover,  obscuration,  and  concealment 
through  nets,  etc.,  only  serve  to  complicate  matters. 

The  overall  goal  of  each  paradigm  is  identical: 
associate  a  prescribed  hypothesis  concerning  target 
type  and  state  to  a  received  piece  of  data.  We  will 
have  N  hypotheses  Ho,  H\7 . . . ,  Hn} . . . ,  Hjy.  The 
hypotheses  under  test  are 

Ho  :  clutter, 

Hn  :  target  in  state 

The  “state”  of  the  target  is  0^= [target  class  k, 
relative  geometry  <j>i}  target  articulation  /?m,  and 
amount  of  obscuration  ^n].  The  variable  n  is  a  four¬ 
dimensional  vector  index  in  state  space  [fc,/,m,n]. 
Examples  of  target  classes  are  k=[ Ml  tank,  T72 
tank,  M35  truck,  etc.].  Relative  geometry  is  based 
on  target  pose  (aspect)  and  collection  geometry 
(squint,  depression  angles).  The  target  articula¬ 
tion  is  based  on  the  state  of  doors,  hatches,  and 
relative  angles  of  turrets,  trailers,  and  other  recon- 
figurable  target  components.  Finally,  obscuration 
accounts  for  shadowing  and  layover  elfects  from  ad¬ 
jacent  clutter  such  as  tree-lines,  and  some  deliberate 
camouflage  and  concealment  techniques. 

The  hypothesis  chosen  is  based  on  the  minimum 
distance,  d7  between  a  set  of  features 
1{X)  =  [fi{X),f2{X),...,fM(X)]t  derived  from 
the  data  under  test,  X}  and  a  reference  function 
vector  Rn 

min  d(l(X),Bn)<T  (1) 

II  TV 

Each  reference  function  vector  is  tied  to  a  spe¬ 
cific  hypothesis  Hn-  That  reference  function  vec¬ 
tor  which  is  closest  to  the  input  data  and  is  “close 
enough”  (below  a  threshold,  7)  determines  which 
hypothesis  the  algorithm  reports. 

For  the  template-based  approach,  we  would  use 
representative  input  data  to  form  a  representative 
exemplar  (i.e.,  f{X)  =  X).  The  reference  function 
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would  then  be  a  spatial  gray-level  template.  Con¬ 
versely,  the  model-based  approach  uses  features  de¬ 
rived  from  an  abstract  target  representation.  The 
reference  function  vector  in  this  case  consists  of  dis¬ 
crete  representative  features  derived  from  a  mathe¬ 
matical  target  signature  model  which  are  matched 
against  corresponding  features  computed  from  the 
input  sensor  data. 

Both  paradigms  offer  a  coarse-to-fine  approach 
to  finding  the  best  hypothesis  for  the  data.  Both 
approaches  have  a  similar  front-end:  a  prescreen¬ 
ing  and  indexing  stage.  The  prescreening  stage  at¬ 
tempts  to  differentiate  between  Ho:  clutter  only} 
from  the  hypothesis  Htgt :  target  predent .  Note  that 
Hi,  Similarly,  the  indexer  subdi¬ 

vides  the  hypothesis  space  Htgt  into  coarse  sub¬ 
spaces  H*?d**  such  as  “tank  at  orientation  30°*  — 
40V?  This  discussion  addresses  the  operations  after 
the  indexing  stage. 

Each  paradigm  has  unique  strengths  and  weak¬ 
nesses,  along  with  philosophical  conflicts.  The 
model-based  approach  accommodates  obscuration 
and  articulation  through  the  models  of  the  tar¬ 
gets.  This  allows  for  a  finer  grained  sampling  of  the 
hypothesis  space  of  interest.  The  template-based 
approach  is  based  on  actual  collected  signatures. 
There  is  no  question  as  to  whether  the  collected  data 
accurately  portray  the  signature  of  interest.  How¬ 
ever,  the  template-based  approach  cannot  densely 
sample  the  entire  hypothesis  space  of  interest.  We 
propose  a  hybrid  approach  that  combines  the  best 
features  of  both. 


Figure  1.  MSTAR  model-based  algorithm 
paradigm. 


tract,  Match,  and  Search  (PEMS)  modules.  The 
PEMS  modules  perform  the  HAT  functions.  The 
Search  module  controls  the  PEMS  loop  by  accumu¬ 
lating  evidence  and  controlling  the  reasoning  pro¬ 
cess.  Search  invokes  the  extract  module,  which  ex¬ 
tracts  features  of  interest  from  the  input  data,  and 
the  prediction  module,  which  predicts  the  value  of 
the  same  feature  set  for  that  target.  The  match 
module  provides  the  computation  of  the  distance 
measure  between  the  extracted  and  predicted  fea¬ 
tures  and  also  the  measure  of  uncertainty.  The  ob¬ 
jective  function  is  the  likelihood  function  of  an  ex¬ 
tracted  feature  set  conditioned  on  a  specific  target 
type/pose  [3]. 


2.  Model-Based  Approach 

A  model-based  ATR  recognizes  targets  by  match¬ 
ing  features  extracted  from  the  unknown  signa¬ 
ture  against  predictions  of  those  features  generated 
from  mathematical  models  of  the  sensing  process 
and  candidate  target  geometries  [1].  Note  that  the 
matching  is  done  on  a  feature  level  which  may  or 
may  not  encompass  the  actual  signature  of  the  tar¬ 
get  itself.  Examples  of  features  can  be  found  in 
the  literature  [2].  The  backbone  of  the  model-based 
paradigm  is  a  hypothesis- and-test  (HAT)  approach 
whereby  a  target  type/state  is  hypothesized,  the  ap¬ 
propriate  feature  set  extracted  from  models,  the  fea¬ 
tures  matched,  and  a  likelihood  score  determined. 
This  process  is  iterated  until  the  hypothesis  with 
the  highest  likelihood  is  decided  upon. 

The  overall  process  flow  of  the  MSTAR  model- 
based  ATR  algorithm  is  shown  in  figure  1.  The 
heart  of  the  MSTAR  algorithm  is  the  Predict,  Ex- 


3.  Template-Based  Approach 

In  the  template-based  approach,  a  set  of  templates 
is  produced  from  collected  data  and  compared  to  the 
input  data.  These  templates  are  representative  spa¬ 
tial  signatures  of  the  given  target  hypothesis.  The 
representative  spatial  signature  can  be  a  sample  of 
the  signature  set  at  a  specific  geometry  /state,  or  a 
linear  combination  of  registered  signatures  over  a 
range  of  geometries/states.  The  STARLOS  algo¬ 
rithm,  for  example,  uses  a  decision  metric  related 
to  a  mean  square  fitting  error  [4,  5],  where 

,  x  __  - _ 1_  V-'  f  aX(i}j)  —  P'njhj)  \ 

l)3 

(2) 

X(i,j)  is  the  image  chip  under  test,  and  _R„  = 
[p.„ ,  (TnJ .  Hn(i,  j)  is  a  “mean  image”  indexed  with 
respect  to  target  class  and  geometry,  cr„  (i,  j )  is  the 
corresponding  target  signature  spatial  standard  de- 


Figure  2.  Template-based  algorithm  paradigm. 


viation,  and  M  is  the  number  of  pixels  in  the  target 
chip.  The  template  that  best  represents  the  data 
and  has  enough  evidence  of  a  match  selects  hypoth¬ 
esis  n.  Figure  2  shows  the  processing  flow  for  the 
template-based  approach. 

The  mean  image fin(h  j)  is  produced  by  linearly 
combining  registered  signatures  of  a  particular  tar¬ 
get  over  a  local  set  of  collection  geometries.  Cur¬ 
rently,  the  emphasis  has  been  on  combining  signa¬ 
tures  over  local  target  aspect.  This  is  done  to  reduce 
the  number  of  templates  over  which  tests  must  be 
made.  Similarly,  the  target  signature  spatial  stan¬ 
dard  deviation  cr^  (f,  j)  is  formed  by  analyzing  the 
pixel-by-pixel  signature  variability. 


Figure  3.  Sampling  of  the  hypothesis  space  for  the 
template-based  and  model-based  methods. 


%™  —  {Hn  :  have  templates  fin},  %MBV  —  { H „  : 
can  generate  model-based  reference  functions  i?a}, 
and  %  is  the  space  of  all  possible  hypotheses. 

jjTM  r^MBV  ^ 

The  difference  in  sampling  of  the  hypothe¬ 
sis  space  suggests  two  alternatives  for  a  hybrid 
model/template  approach. 

4.1.  TM  for  Fine  Indexing 

This  approach,  shown  in  figure  4,  inserts  the  TM 
algorithm  as  another  indexing  stage  to  further  re¬ 
fine  the  hypothesis  estimates  before  the  application 
of  the  MBV  algorithm.  Refined  hypotheses  are  ob¬ 
tained  through  the  template  set.  This  reduces  the 
number  of  hypotheses  over  which  the  MBV  algo¬ 
rithm  must  search.  The  drawback  for  this  approach 
is  that  a  large  template  set  must  still  be  used.  The 
added  computation  of  this  insertion  may  offset  the 
advantage  of  not  having  to  search  over  a  large  hy¬ 
pothesis  space. 


4.  Hybrid  Model/ Template  Approaches 

The  granularity  of  potential  hypothesis  space  helps 
to  differentiate  between  TM  and  MBV.  Because 
of  the  requirement  to  generate  a  computationally 
manageable  set  of  discrete  hypotheses  while  span¬ 
ning  the  large  target /pose/articulation/obscuration 
space,  the  TM-based  approaches  must  perform  a 
coarser  sampling  of  the  hypothesis  space  than  can 
be  accomplished  with  an  MBV-based  approach. 
TM  approaches  offer  a  computational  advantage, 
however,  in  that  the  templates  are  calculated  off¬ 
line,  stored  in  memory,  and  are  thus  directly  avail¬ 
able  for  application  to  the  decision  statistic.  The 
model-based  approach,  while  providing  a  finer  sam¬ 
pling  of  the  hypothesis  space,  must  reproduce  a 
feature  set  indicative  of  the  hypothesis  prior  to 
application  of  the  decision  statistic.  A  graphi¬ 
cal  description  of  this  is  shown  in  figure  3,  where 


4.2.  MBV  for  Template  Perturbation 

A  more  promising  alternative  is  to  employ  the 
model-based  paradigm  as  a  means  of  altering  the 
templates  (reference  functions)  that  are  applied  to 
the  data.  In  this  approach,  the  templates  would  be 
seen  as  the  core  signature  upon  which  model-based 
distortion  operators  and  signature  changes  would 
be  made.  The  amended  reference  function  would 
become 

g^O^lRj  +  SR^,  (3) 

where  ?V  =  n  +  Sn}  Hn  E  %™ ,  and  Hn E 
% mbv  £  %™ .  A  simple  starting  point  is  to  use 
the  mean  image  template  as  the  reference  function 
(i.e.,  Rn  =  fin).  The  operator  O V  would  predom¬ 
inant  ly"account  for  geometry  and  pose  variations. 
This  could  be  interpreted  as  a  “distortion  opera¬ 
tion.”  The  term  SRn,  changes  the  signature  based 


5.  Discussion 


Figure  4.  Straightforward  hybrid  model/ template 
approach  where  the  template  approach  is  used  as 
a  fine  indexing  stage. 


on  articulation,  obscuration,  and  geometry  consid¬ 
erations.  On  would  be  stored  and  called  as  required, 
permitting  reference  signature  modification  via  the 
model-based  predictor. 

An  interesting  aspect  of  this  construction  is  the 
case  where  fine  internal  detail  helps  determine  which 
hypotheses  to  pursue.  The  template  would  be  seg¬ 
mented  into  “stable”  and  “unstable”  regions  (a  no¬ 
tion  similar  to  the  chunky  template  approach  [5])  as 
a  function  of  target  state.  The  | X(i7j)  — 
term  in  equation  (2)  can  rapidly  determine  the  spa¬ 
tial  locations  of  poor  template  fits.  The  hypotheses 
which  provide  SprA}yj)  over  the  local  area  are  now 
searched. 

As  an  example,  if  a  template  mismatch  oc¬ 
curs  in  areas  of  known  signature  articulation  de¬ 
pendence,  generate  model-based  deformations  con¬ 
sistent  with  articulation  hypotheses.  Alternatively, 
if  the  template  signature  mismatch  occurs  over  pos¬ 
tulated  stable  areas,  the  target  class  may  need  to 
be  amended  or  specific  localized  scatterers  (e.g.,  a 
removable  external  store)  added  or  subtracted  using 
model-based  deformation  to  provide  best  fit. 

The  advantage  of  this  approach  is  that  the  num¬ 
ber  of  potential  templates  required  may  be  reduced 
and  the  model-based  search  would  amend  the  tem¬ 
plates  and  fill  in  the  gaps.  In  addition,  the  model- 
based  approach  would  need  to  predict  only  the  areas 
where  the  signature  has  changed,  not  the  full  signa¬ 
ture,  thereby  reducing  computational  cost. 


We  have  proposed  a  hybrid  template/model-based 
approach  for  ATR  which  we  believe  has  potential  to 
draw  upon  the  strengths  of  both  paradigms.  One  of 
the  major  stumbling  blocks  for  any  attempt  to  cre¬ 
ate  a  hybrid  template/model-based  system  is  the 
veracity  of  the  models  themselves.  This  problem  is 
exacerbated  by  the  notion  that  the  model  must  now 
interact  directly  with  the  template  in  our  construc¬ 
tion.  The  other  large,  and  unsolved,  issue  is  the  no¬ 
tion  of  scatterer  correspondence  between  the  model 
generated  signature  and  the  collected  template. 

An  experiment  is  briefly  proposed.  Using  a  set 
of  templates  derived  from  collected  data  which  are 
well  ground  truthed  with  respect  to  the  specific  hy¬ 
pothesis:  (1)  test  the  templates  using  MBV  to  see 
if  MBV  declares  the  correct  hypothesis;  (2)  if  not, 
amend  the  CAD  model  until  correct  declaration;  3) 
if  yes,  compare  the  predicted  model  based  signature 
to  the  template  itself;  4)  amend  the  CAD  model  un¬ 
til  the  signatures  closely  match.  If  this  experiment 
is  successful,  assume  that  the  model  will  correctly 
predict  signatures  for  the  finer  set  of  hypotheses. 
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